Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
2.
Int J Neural Syst ; 33(6): 2350028, 2023 May.
Artigo em Inglês | MEDLINE | ID: mdl-37118909

RESUMO

Parkinson's disease (PD) is a neurodegenerative condition with constantly increasing prevalence rates, affecting strongly life quality in terms of neuromotor and cognitive performance. PD symptoms include voice and speech alterations, known as hypokinetic dysarthria (HD). Unstable phonation is one of the manifestations of HD. Repetitive transcranial magnetic stimulation (rTMS) is a rehabilitative treatment thathas been shown to improve some motor and non-motor symptoms of persons with PD (PwP). This study analyzed the phonation functional behavior of 18 participants (13 males, 5 females) with PD diagnosis before (one pre-stimulus) and after (four post-stimulus) evaluation sessions of rTMS treatment, to assess the extent of changes in their phonation stability. Participants were randomized 1:1 to receive either rTMS or sham stimulation. Voice recordings of a sustained vowel [a:] taken immediately before and after the treatment, and at follow-up evaluation sessions (immediately after, at six, ten, and fourteen weeks after the baseline assessment) were processed by inverse filtering to estimate a biomechanical correlate of vocal fold tension. This estimate was further band-pass filtered into EEG-related frequency bands. Log-likelihood ratios (LLRs) between pre- and post-stimulus amplitude distributions of each frequency band showed significant differences in five cases actively stimulated. Seven cases submitted to the sham protocol did not show relevant improvements in phonation instability. Conversely, four active cases did not show phonation improvements, whereas two sham cases did. The study provides early preliminary insights into the capability of phonation quality assessment by monitoring neuromechanical activity from acoustic signals in frequency bands aligned with EEG ones.


Assuntos
Doença de Parkinson , Masculino , Feminino , Humanos , Estimulação Magnética Transcraniana/métodos , Projetos Piloto , Fonação , Disartria , Eletroencefalografia
4.
Front Hum Neurosci ; 15: 661392, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34149380

RESUMO

Complex simultaneous neuropsychophysiological mechanisms are responsible for the processing of the information to be transmitted and for the neuromotor planning of the articulatory organs involved in speech. The nature of this set of mechanisms is closely linked to the clinical state of the subject. Thus, for example, in populations with neurodevelopmental deficits, these underlying neuropsychophysiological procedures are deficient and determine their phonation. Most of these cases with neurodevelopmental deficits are due to a genetic abnormality, as is the case in the population with Smith-Magenis syndrome (SMS). SMS is associated with neurodevelopmental deficits, intellectual disability, and a cohort of characteristic phenotypic features, including voice quality, which does not seem to be in line with the gender, age, and complexion of the diagnosed subject. The phonatory profile and speech features in this syndrome are dysphonia, high f0, excess vocal muscle stiffness, fluency alterations, numerous syllabic simplifications, phoneme omissions, and unintelligibility of speech. This exploratory study investigates whether the neuromotor deficits in children with SMS adversely affect phonation as compared to typically developing children without neuromotor deficits, which has not been previously determined. The authors compare the phonatory performance of a group of children with SMS (N = 12) with a healthy control group of children (N = 12) matched in age, gender, and grouped into two age ranges. The first group ranges from 5 to 7 years old, and the second group goes from 8 to 12 years old. Group differences were determined for two forms of acoustic analysis performed on repeated recordings of the sustained vowel /a/ F1 and F2 extraction and cepstral peak prominence (CPP). It is expected that the results will enlighten the question of the underlying neuromotor aspects of phonation in SMS population. These findings could provide evidence of the susceptibility of phonation of speech to neuromotor disturbances, regardless of their origin.

5.
Int J Neural Syst ; 30(10): 2050058, 2020 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-32880202

RESUMO

Speech is controlled by axial neuromotor systems, therefore, it is highly sensitive to the effects of neurodegenerative illnesses such as Parkinson's Disease (PD). Patients suffering from PD present important alterations in speech, which are manifested in phonation, articulation, prosody, and fluency. These alterations may be evaluated using statistical methods on features obtained from glottal, spectral, cepstral, or fractal descriptions of speech. This work introduces an evaluation paradigm based on Information Theory (IT) to differentiate the effects of PD and aging on glottal amplitude distributions. The study is conducted on a database including 48 PD patients (24 males, 24 females), 48 age-matched healthy controls (HC, 24 males, 24 females), and 48 mid-age normative subjects (NS, 24 males, 24 females). It may be concluded from the study that Hierarchical Clustering (HiCl) methods produce a clear separation between the phonation of PD patients from NS subjects (accuracy of 89.6% for both male and female subsets), but the separation between PD patients and HC subjects is less efficient (accuracy of 75.0% for the male subset and 70.8% for the female subset). Conversely, using feature selection and Support Vector Machine (SVM) classification, the differentiation between PD and HC is substantially improved (accuracy of 94.8% for the male subset and 92.8% for the female subset). This improvement was mainly boosted by feature selection, at a cost of information and generalization losses. The results point to the possibility that speech deterioration may affect HC phonation with aging, reducing its difference to PD phonation.


Assuntos
Envelhecimento/fisiologia , Doença de Parkinson/fisiopatologia , Fonação/fisiologia , Distúrbios da Fala/fisiopatologia , Máquina de Vetores de Suporte , Idoso , Diagnóstico Diferencial , Feminino , Humanos , Masculino , Doença de Parkinson/complicações , Acústica da Fala , Distúrbios da Fala/etiologia
6.
Int J Neural Syst ; 29(2): 1850039, 2019 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-30409059

RESUMO

Speech articulation is produced by the movements of muscles in the larynx, pharynx, mouth and face. Therefore speech shows acoustic features as formants which are directly related with neuromotor actions of these muscles. The first two formants are strongly related with jaw and tongue muscular activity. Speech can be used as a simple and ubiquitous signal, easy to record and process, either locally or on e-Health platforms. This fact may open a wide set of applications in the study of functional grading and monitoring neurodegenerative diseases. A relevant question, in this sense, is how far speech correlates and neuromotor actions are related. This preliminary study is intended to find answers to this question by using surface electromyographic recordings on the masseter and the acoustic kinematics related with the first formant. It is shown in the study that relevant correlations can be found among the surface electromyographic activity (dynamic muscle behavior) and the positions and first derivatives of the first formant (kinematic variables related to vertical velocity and acceleration of the joint jaw and tongue biomechanical system). As an application example, it is shown that the probability density function associated to these kinematic variables is more sensitive than classical features as Vowel Space Area (VSA) or Formant Centralization Ratio (FCR) in characterizing neuromotor degeneration in Parkinson's Disease.


Assuntos
Eletromiografia/métodos , Músculo Masseter/fisiologia , Modelos Neurológicos , Medida da Produção da Fala/métodos , Fala/fisiologia , Adulto , Idoso , Fenômenos Biomecânicos , Disartria/diagnóstico , Disartria/etiologia , Humanos , Arcada Osseodentária/fisiologia , Pessoa de Meia-Idade , Doença de Parkinson/complicações , Doença de Parkinson/diagnóstico , Língua/fisiologia
7.
Int J Neural Syst ; 29(2): 1850037, 2019 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-30336711

RESUMO

Neurodegenerative pathologies as Parkinson's Disease (PD) show important distortions in speech, affecting fluency, prosody, articulation and phonation. Classically, measurements based on articulation gestures altering formant positions, as the Vocal Space Area (VSA) or the Formant Centralization Ratio (FCR) have been proposed to measure speech distortion, but these markers are based mainly on static positions of sustained vowels. The present study introduces a measurement based on the mutual information distance among probability density functions of kinematic correlates derived from formant dynamics. An absolute kinematic velocity associated to the position of the jaw and tongue articulation gestures is estimated and modeled statistically. The distribution of this feature may differentiate PD patients from normative speakers during sustained vowel emission. The study is based on a limited database of 53 male PD patients, contrasted to a very selected and stable set of eight normative speakers. In this sense, distances based on Kullback-Leibler divergence seem to be sensitive to PD articulation instability. Correlation studies show statistically relevant relationship between information contents based on articulation instability to certain motor and nonmotor clinical scores, such as freezing of gait, or sleep disorders. Remarkably, one of the statistically relevant correlations point out to the time interval passed since the first diagnostic. These results stress the need of defining scoring scales specifically designed for speech disability estimation and monitoring methodologies in degenerative diseases of neuromotor origin.


Assuntos
Transtornos da Articulação/fisiopatologia , Fenômenos Biomecânicos/fisiologia , Doença de Parkinson/diagnóstico , Idoso , Transtornos da Articulação/etiologia , Conjuntos de Dados como Assunto , Disartria/etiologia , Disartria/fisiopatologia , Humanos , Arcada Osseodentária/fisiopatologia , Masculino , Pessoa de Meia-Idade , Doença de Parkinson/complicações , Índice de Gravidade de Doença , Língua/fisiopatologia
8.
Stem Cells Int ; 2018: 8917913, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-29760737

RESUMO

BACKGROUND: Paralysis of one vocal fold leads to glottal gap and vocal fold insufficiency that has significant impact upon a patient's quality of life. Fillers have been tested to perform intracordal injections, but they do not provide perdurable results. Early data suggest that enriching fat grafts with adipose-derived regenerative cells (ADRCs) promote angiogenesis and modulate the immune response, improving graft survival. The aim of this study is to propose ADRC-enriched adipose tissue grafts as effective filler for the paralyzed vocal fold to use it for functional reconstruction of the glottal gap. METHOD: This is the first phase I-IIA clinical trial (phase I/IIA clinical trial, unicentric, randomized, controlled, and two parallel groups), to evaluate the safety of a new therapy with ADRC-enriched fat grafting (ADRC: group I) for laryngoplasty after unilateral vocal fold paralysis. Control group patients received centrifuged autologous fat (CAF: group II) grafts. Overall mean age is 52.49 ± 16.60 years. Group I (ADRC): 7 patients (3 males and 4 females), 52.28 ± 20.95 year. Group II (CAF): 7 patients (3 males and 4 females), 52.71 ± 12.59 year. RESULTS: VHI-10 test showed that preoperative mean score was 24.21 ± 8.28. Postoperative mean score was 6.71 ± 6.75. Preoperative result in group I was 21.14 ± 3.58 and postoperative result was 3.14 ± 3.53. Preoperative result for group II was 27.29 ± 10.66. Postoperative score in group II was 10.29 ± 7.52. Wilcoxon and the Student t-tests showed that the patient's self-perception of posttreatment improvement is larger when ADRCs are used. Comparing pre- and posttreatment voice quality analysis, group I showed a p = 0.053. Group II showed a p = 0.007. There would be no significant differentiation between pre- and posttreatment results. This is true for group II and limited for group I. CONCLUSIONS: This prospective trial demonstrates the safety and efficacy of the treatment of glottal gap defects utilizing ADRC-enriched fat grafts. This trial is registered with NCT02904824.

9.
Cognit Comput ; 10(6): 1006-1018, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30595758

RESUMO

Hypokinetic dysarthria (HD) and freezing of gait (FOG) are both axial symptoms that occur in patients with Parkinson's disease (PD). It is assumed they have some common pathophysiological mechanisms and therefore that speech disorders in PD can predict FOG deficits within the horizon of some years. The aim of this study is to employ a complex quantitative analysis of the phonation, articulation and prosody in PD patients in order to identify the relationship between HD and FOG, and establish a mathematical model that would predict FOG deficits using acoustic analysis at baseline. We enrolled 75 PD patients who were assessed by 6 clinical scales including the Freezing of Gait Questionnaire (FOG-Q). We subsequently extracted 19 acoustic measures quantifying speech disorders in the fields of phonation, articulation and prosody. To identify the relationship between HD and FOG, we performed a partial correlation analysis. Finally, based on the selected acoustic measures, we trained regression models to predict the change in FOG during a 2-year follow-up. We identified significant correlations between FOG-Q scores and the acoustic measures based on formant frequencies (quantifying the movement of the tongue and jaw) and speech rate. Using the regression models, we were able to predict a change in particular FOG-Q scores with an error of between 7.4 and 17.0 %. This study is suggesting that FOG in patients with PD is mainly linked to improper articulation, a disturbed speech rate and to intelligibility. We have also proved that the acoustic analysis of HD at the baseline can be used as a predictor of the FOG deficit during 2 years of follow-up. This knowledge enables researchers to introduce new cognitive systems that predict gait difficulties in PD patients.

10.
Front Neuroinform ; 11: 56, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28970792

RESUMO

Aim: The research described is intended to give a description of articulation dynamics as a correlate of the kinematic behavior of the jaw-tongue biomechanical system, encoded as a probability distribution of an absolute joint velocity. This distribution may be used in detecting and grading speech from patients affected by neurodegenerative illnesses, as Parkinson Disease. Hypothesis: The work hypothesis is that the probability density function of the absolute joint velocity includes information on the stability of phonation when applied to sustained vowels, as well as on fluency if applied to connected speech. Methods: A dataset of sustained vowels recorded from Parkinson Disease patients is contrasted with similar recordings from normative subjects. The probability distribution of the absolute kinematic velocity of the jaw-tongue system is extracted from each utterance. A Random Least Squares Feed-Forward Network (RLSFN) has been used as a binary classifier working on the pathological and normative datasets in a leave-one-out strategy. Monte Carlo simulations have been conducted to estimate the influence of the stochastic nature of the classifier. Two datasets for each gender were tested (males and females) including 26 normative and 53 pathological subjects in the male set, and 25 normative and 38 pathological in the female set. Results: Male and female data subsets were tested in single runs, yielding equal error rates under 0.6% (Accuracy over 99.4%). Due to the stochastic nature of each experiment, Monte Carlo runs were conducted to test the reliability of the methodology. The average detection results after 200 Montecarlo runs of a 200 hyperplane hidden layer RLSFN are given in terms of Sensitivity (males: 0.9946, females: 0.9942), Specificity (males: 0.9944, females: 0.9941) and Accuracy (males: 0.9945, females: 0.9942). The area under the ROC curve is 0.9947 (males) and 0.9945 (females). The equal error rate is 0.0054 (males) and 0.0057 (females). Conclusions: The proposed methodology avails that the use of highly normalized descriptors as the probability distribution of kinematic variables of vowel articulation stability, which has some interesting properties in terms of information theory, boosts the potential of simple yet powerful classifiers in producing quite acceptable detection results in Parkinson Disease.

11.
Forensic Sci Int ; 270: 25-38, 2017 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-27912151

RESUMO

There is a growing consensus that hybrid approaches are necessary for successful speaker characterization in Forensic Speaker Comparison (FSC); hence this study explores the forensic potential of voice features combining source and filter characteristics. The former relate to the action of the vocal folds while the latter reflect the geometry of the speaker's vocal tract. This set of features have been extracted from pause fillers, which are long enough for robust feature estimation while spontaneous enough to be extracted from voice samples in real forensic casework. Speaker similarity was measured using standardized Euclidean Distances (ED) between pairs of speakers: 54 different-speaker (DS) comparisons, 54 same-speaker (SS) comparisons and 12 comparisons between monozygotic twins (MZ). Results revealed that the differences between DS and SS comparisons were significant in both high quality and telephone-filtered recordings, with no false rejections and limited false acceptances; this finding suggests that this set of voice features is highly speaker-dependent and therefore forensically useful. Mean ED for MZ pairs lies between the average ED for SS comparisons and DS comparisons, as expected according to the literature on twin voices. Specific cases of MZ speakers with very high ED (i.e. strong dissimilarity) are discussed in the context of sociophonetic and twin studies. A preliminary simplification of the Vocal Profile Analysis (VPA) Scheme is proposed, which enables the quantification of voice quality features in the perceptual assessment of speaker similarity, and allows for the calculation of perceptual-acoustic correlations. The adequacy of z-score normalization for this study is also discussed, as well as the relevance of heat maps for detecting the so-called phantoms in recent approaches to the biometric menagerie.


Assuntos
Modelos Estatísticos , Acústica da Fala , Voz , Ciências Forenses , Humanos , Fonética , Gêmeos Monozigóticos , Qualidade da Voz
12.
Artigo em Inglês | MEDLINE | ID: mdl-26442245

RESUMO

Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g., YouTube) to broadcast its message. In this new scenario, classical identification methods (such as fingerprints or face recognition) have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. The present study benefits from the advances achieved during last years in understanding and modeling voice production. The paper hypothesizes that a gender-dependent characterization of speakers combined with the use of a set of features derived from the components, resulting from the deconstruction of the voice into its glottal source and vocal tract estimates, will enhance recognition rates when compared to classical approaches. A general description about the main hypothesis and the methodology followed to extract the gender-dependent extended biometric parameters is given. Experimental validation is carried out both on a highly controlled acoustic condition database, and on a mobile phone network recorded under non-controlled acoustic conditions.

13.
J Voice ; 24(6): 667-77, 2010 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-20207107

RESUMO

A new index is introduced in this article to measure the degree of normality in the speech. The proposed parameter has demonstrated to be correlated with the perceived hoarseness, giving an indication of the degree of normality. The calculation of such a parameter is based on a statistical model developed to represent normal and pathological voices. The modeling is built around Gaussian mixture models and Mel frequency cepstral coefficients. The proposed index has been named pathological likelihood index (PLI). PLI is compared with other aperiodicity features (such as jitter and shimmer), and measurements sensitive to additive noise (such as harmonics-to-noise ratio (HNR), cepstrum-based HNR, normalized noise energy, and glottal-to-noise excitation ratio). The proposed parameter is revealed to be a good estimator of the presence of pathology, showing lower correlation with noise, frequency, and amplitude perturbation parameters than these classical features among them.


Assuntos
Rouquidão/diagnóstico , Funções Verossimilhança , Fonação , Percepção da Fala , Qualidade da Voz , Análise de Fourier , Rouquidão/fisiopatologia , Rouquidão/psicologia , Humanos , Reprodutibilidade dos Testes , Índice de Gravidade de Doença , Processamento de Sinais Assistido por Computador , Espectrografia do Som , Acústica da Fala , Medida da Produção da Fala , Fatores de Tempo
14.
J Voice ; 24(1): 47-56, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19135854

RESUMO

This paper evaluates the capabilities of the Glottal to Noise Excitation Ratio for the screening of voice disorders. A lot of effort has been made using this parameter to evaluate voice quality, but there do not exist any studies that evaluate the discrimination capabilities of this acoustic parameter to classify between normal and pathological voices, and neither are there any previous studies that reflect the normative values that could be used for screening purposes. A set of 226 speakers (53 normal and 173 pathological) taken from a voice disorders database were used to evaluate the usefulness of this parameter for discriminating normal and pathological voices. To evaluate this parameter, the effect of the bandwidth of the Hilbert envelopes and the frequency shift have been analyzed, concluding that a good discrimination is obtained with a bandwidth of 1000 Hz and a frequency shift of 300 Hz. The results confirm that the Glottal to Noise Excitation Ratio provides reliable measurements in terms of discrimination among normal and pathological voices, comparable to other classical long-term noise measurements found in the literature, such as Normalized Noise Energy or Harmonics to Noise Ratio, so this parameter can be considered a good choice for screening purposes.


Assuntos
Glote/fisiopatologia , Ruído , Acústica da Fala , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/fisiopatologia , Adulto , Algoritmos , Área Sob a Curva , Bases de Dados como Assunto , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Curva ROC , Caracteres Sexuais , Voz , Adulto Jovem
15.
IEEE Trans Biomed Eng ; 53(10): 1943-53, 2006 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-17019858

RESUMO

Voice diseases have been increasing dramatically in recent times due mainly to unhealthy social habits and voice abuse. These diseases must be diagnosed and treated at an early stage, especially in the case of larynx cancer. It is widely recognized that vocal and voice diseases do not necessarily cause changes in voice quality as perceived by a listener. Acoustic analysis could be a useful tool to diagnose this type of disease. Preliminary research has shown that the detection of voice alterations can be carried out by means of Gaussian mixture models and short-term mel cepstral parameters complemented by frame energy together with first and second derivatives. This paper, using the F-Ratio and Fisher's discriminant ratio, will demonstrate that the detection of voice impairments can be performed using both mel cesptral vectors and their first derivative, ignoring the second derivative.


Assuntos
Diagnóstico por Computador/métodos , Modelos Biológicos , Espectrografia do Som/métodos , Medida da Produção da Fala/métodos , Distúrbios da Voz/diagnóstico , Distúrbios da Voz/fisiopatologia , Qualidade da Voz , Simulação por Computador , Humanos , Modelos Estatísticos , Distribuição Normal , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
16.
Med Eng Phys ; 28(3): 276-89, 2006 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-15950513

RESUMO

A PC-based integrated aid tool has been developed for the analysis and screening of pathological voices. With it the user can simultaneously record speech, electroglottographic (EGG), and videoendoscopic signals, and synchronously edit them to select the most significant segments. These multimedia data are stored on a relational database, together with a patient's personal information, anamnesis, diagnosis, visits, explorations and any other comment the specialist may wish to include. The speech and EGG waveforms are analysed by means of temporal representations and the quantitative measurements of parameters such as spectrograms, frequency and amplitude perturbation measurements, harmonic energy, noise, etc. are calculated using digital signal processing techniques, giving an idea of the degree of hoarseness and quality of the voice register. Within this framework, the system uses a standard protocol to evaluate and build complete databases of voice disorders. The target users of this system are speech and language therapists and ear nose and throat (ENT) clinicians. The application can be easily configured to cover the needs of both groups of professionals. The software has a user-friendly Windows style interface. The PC should be equipped with standard sound and video capture cards. Signals are captured using common transducers: a microphone, an electroglottograph and a fiberscope or telelaryngoscope. The clinical usefulness of the system is addressed in a comprehensive evaluation section.


Assuntos
Diagnóstico por Computador/métodos , Laringoscopia/métodos , Sistemas Computadorizados de Registros Médicos , Software , Espectrografia do Som/métodos , Interface Usuário-Computador , Distúrbios da Voz/diagnóstico , Gráficos por Computador , Sistemas de Gerenciamento de Base de Dados , Eletroencefalografia/métodos , Armazenamento e Recuperação da Informação/métodos , Design de Software , Integração de Sistemas , Telemedicina/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...